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Abstract 

We present a novel polynomial time approximation scheme for two-strategy anonymous 
games, in which the players' utility functions, although potentially different, do not differentiate 
among the identities of the other players. Our algorithm computes an e-approximate Nash 
equilibrium of an n-player 2-strategy anonymous game in time poly(n) • which 
significantly improves upon the running time vP'^^l'^ ^ required by the algorithm of Daskalakis 
& Papadimitriou, 2007. The improved running time is based on a new structural understanding 
of approximate Nash equilibria: We show that, for any e, there exists an e-approximate Nash 
equilibrium in which either only 0(l/e"^) players randomize, or all players who randomize use 
the same mixed strategy. To show this result we employ tools from the literature on Stein's 
Method. 

1 Introduction 

It has been recently estabhshed that computing a Nash equilibrium is an intractable problem [18^ 
[TOl [6l [13], even in the case of two-player games [7]. In view of this hardness result, research has 
been directed towards the computation of approximate Nash equilibria, which are states of the 
game in which no player has more than some small e incentive to change her strategy. But, despite 
much research in this direction [221 l2Tl [TTl Wl\ [T2t [5l [27], only constant e's can be achieved in 
polynomial time. Yet, an approximate Nash equilibrium in which the players have regret equal 
to a significant fraction of their payoffs is not an attractive solution concept; after all, there is no 
reason to expect a player to keep her strategy if she can significantly improve by changing to a 
different one. On the contrary, if e were arbitrarily small, it could be that the cost of switching one's 
strategy is larger than the regret e that she suffers. Hence, approximate equilibria with arbitrarily 
close approximation could be credible solutions concepts. The following question then emerges: Is 
there a Polynomial Time Approximation Scheme for approximate Nash equilibria? 

The question remains open for general games, but there are special classes known to be tractable. 
It is well-known, for example, that zero-sum games are solvable exactly in polynomial time by Linear 
Programming \24:\ [9]. This tractability result has been extended to a generalization of zero-sum 
games, called two-player low-rank games, in which the sum of the players' payoff tables has fixed 
rank; in this case there is a PTAS for approximate Nash equilibria. It has also been shown that 
symmetric multi-player games with (about logarithmically) few strategies per player can be solved 
exactly in polynomial time by a reduction to the theory of real closed fields [25j. In congestion 
games, we can compute in polynomial time a pure Nash equilibrium, if the game is a symmetric 
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network congestion game |16] . and an approximate pure Nash equilibrium, if the congestion game 
is symmetric (but not necessarily network) and the utilities are somehow "continuous" [8j . 

In this paper, we consider another important class of games, called anonymous. These are 
games in which each player's utility function does not differentiate among the identities of the 
other players. That is, the payoff of a player depends on the strategy that she chooses and only 
the number of other players choosing each strategy. Anonymous games comprise a broad and well 
studied class of games (see, e.g., [U IH [I9l |23] for recent work on this subject by economists) which 
are of special interest to the Algorithmic Game Theory community, as they capture important 
aspects of auctions and markets, as well as of Internet congestion. 

But, what do we know about computing Nash equilibria in anonymous games? It was recently 
established that there is a PTAS for the case of a constant number of strategies per player [141 115j . 
The running time of the algorithm given in [15] is n'-^(-f(^'^/^)\ where e is the desired approximation, 
s the number of strategies available to the players, and / some function which is polynomial in 
1/e, but superpolynomial in s. Hence, although theoretically efficient for any fixed e and s, the 
algorithm is highly non-practical. Even for the simpler case of two-strategy anonymous games the 
running time achieved by [13| is n*^^^/*^ ^ . 

In this paper, we present a more efficient algorithm for 2-strategy anonymous games, which 
runs in time poly(n) • (l/e)*^^^/*^ \ The improved running time is due to a novel understanding of 
certain structural properties of approximate Nash equilibria. In particular, we show that, for any 
integer k, there exists an e-approximate Nash equilibrium, with e = 0{l/k), in which 

(a) either at most = 0((l/e)^) players use randomized strategies, and their strategies are 



integer multiples of 1/A;^ ; 

(b) or all players who randomize choose the same mixed strategy which is also an integer multiple 
of FT- 

To derive the above characterization, we study mixed strategy profiles in the proximity of a Nash 
equilibrium. We establish that there always exists a nearby mixed strategy profile which is of one 
of the types (a) or (b) described above and satisfies the Nash equilibrium conditions to within 
an additive e, thus corresponding to an e-approximate equilibrium. Given this structural result 
(see Theorem 13. ip . an e-approximate equilibrium can be found by dynamic programming (see 
Theorem [321). 

We feel that a more sophisticated analysis can establish similar structural properties for ap- 
proximate Nash equilibria in multi-strategy anonymous games, extending our efficient PTAS to 
anonymous games with any fixed number of strategies. 

Overview of Techniques: Let us track down the effect in the Nash equilibrium resulting by replac- 
ing a mixed Nash equilibrium (pi, . . . ,Pn) £ [0, 1]" by another strategy proffie (qi, . . . , qn) € [0, l]", 
where the probabilities pi and qi correspond to the mixed strategy of player i in the two strategy 
profiles. It is not hard to see that the approximation achieved by the strategy profile {qi, . . . , qn) can 
be, loosely speaking, bounded by the total variation distance between the distribution of the sum 
of n Bernoulli random variables with expectations pi, . . . ,pn and that of another sum of Bernoulli 
random variables with expectations qi, . . . , qn- Hence, to establish the structural property described 
above it is sufficient to show that given any set of probability values (pi, . . . ,Pn) there is another set 
(qi,. . ■ ,qn) which satisfies either Property (a) or Property (b) and is such that the total variation 
distance between the two sums of Bernoulli random variables with expectations {pi}i and {qi}i 
respectively is at most e. 

^Note that, since every player has 2 strategies, a mixed strategy is a number in [0, 1]. 
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To give some insight into the construction of the set {qi}i, let us consider the following scenarios 
for an integer k: 

(i) at least of the pj's fall in the set 1 — 1/k] and the others are either or 1; 

(ii) at most k^ of the pi's fall in the set 1 — 1/k] and the others are either or 1; 

Let us consider Case (i) first. It is reasonable to expect, by the Central Limit Theorem, or finitary 
versions thereof, that the sum of at least k^ Bernoulli random variables with expectations from the 
set [l/fc, 1 — 1/k] is close in total variation distance to a Normal distribution, with the appropriate 
mean and variance, and, hence, to a Binomial distribution which approximates that Normal distri- 
bution. In Section 16.11 we show that this is indeed the case, even if we only allow the probability 
of the Binomial distribution to be an integer multiple of 1/kn. Hence, the sum of the original 
Bernoulli random variables with expectations from the set 1 — 1/k] can be approximated by 
another set of Bernoulli random variables which all share the same mean, which, moreover, is an 
integer multiple of 1/kn. Hence, an approximate equilibrium satisfying Property (b) above can be 
defined. 

In the Case (ii) , approximating by a Normal distribution is not tight enough to give overall total 
variation distance of 0{l/k). We resort instead to the following structural result shown in [14j: 
Given any set of Bernoulli random variables with expectations pi, . . . ,pn, there is a way to round 
the probabilities to multiples of for any k, so that the distribution of the sum of these n 

variables is affected by an additive 0{l/k) in total variational distance. Hence, an approximate 
equilibrium satisfying Property (a) above can be defined (see Section [6^ . 

It remains to argue that the Cases (i) and (ii) are general enough. For this, we describe an 
iterative procedure which alters the values of those pi's falling outside the set {0, 1} U 1 — 
in such a way that, in the end of this procedure, all pj's do fall in the set {0, 1} U [1/k, 1 — 1/k] 
and the distribution of the sum of Bernoulli random variables with expectations pi does not change 
by more than 0{l/k) in total variation distance. Let us, e.g., consider the pi's falling in the set 
(0, 1/A;) and round some of them to and some of them to 1/ A; so that their sum is approximated to 
within 0{l/k). By the law of rare events (the precise result we use is from the literature on Poisson 
approximations, see, e.g., [1]), the sum of Bernoulli's before and after the rounding is distributed 
like two Poisson distributions with means equal to the sum of pi's before and after the rounding 
respectively. Since these means are within 0{l/k), the two Poisson distributions are within 0{l/k) 
in total variation distance and so are the sums of the Bernoulli's before and after the rounding. 
For details see Section [5l 

2 Definitions and Notation 

A game has n players, 1, . . . , n, and t strategies, 1, 2, . . . , t, available to them, so that each player 
gets some payoff for every selection of strategies by her and the other players. The game is called 
anonymous, if the payoff of each player depends on her strategy and only the number, but not the 
identities, of the other players who choose each of the t strategies. 

In this paper, we study two-strategy anonymous games. In thesegames, the payoff function of 
each player i is specified by giving u\,u\ : {0, 1, . . . , n — 1} — > [0, !]□ so that u\(ra) is the payoff 
of i, if she chooses strategy s and m of the other players choose strategy 2. Hence, the game is 
succinctly representable [25], in the sense that its representation requires 2?!^ numbers, as opposed 

^In the literature on Nash approximation, utilities are usually normalized in this way so that the approximation 
error is additive. 
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to the (exponential in the number of players) nt^ numbers required for general games. Arguably, 
succinct games are the only multiplayer games that are computationally meaningful (see [25] for 
an extensive discussion of this point). 

A mixed strategy profile is a set of n probability values pi,P2, ■ ■ ■ ,Pn £ [0, 1], corresponding to 
the probability with which each player chooses strategy 2. A mixed strategy profile is an e-Nash 
equilibrium if, for all i € [n], the following hold 



where for the purposes of the expectation x is drawn from {0, . . . , n — 1} by tossing n—1 independent 
coins with probabilities {pjl^yj. That is, a mixed strategy profile is an e-Nash equilibrium if every 
player is only randomizing among strategies which, when played against the mixed strategies of the 
other players, achieve expected payoff within (additive) e from the expected payoff achieved by the 
best strategy. 

The notion of e-Nash equilibrium is closely related to the notion of e-approximate Nash equilib- 
rium, defined as any mixed strategy profile in which no player can improve his expected payoff by 
more than e by changing to a different mixed strategy. It is easy to see that any e-Nash equilibrium 
is also an e-approximate Nash equilibrium, but the opposite implication is not true in general. In 
this paper, we present algorithms for computing (the stronger notion of) e-Nash equilibria. 

Anonymous games can be extended to ones in which there is also a finite number of types of 
players, and utilities depend on how many players of each type play each of the available strategies. 
Our algorithm can be easily generalized to this framework, with the number of types multiplying 
the exponent of the running time. 

Let us conclude this section with a few more definitions. We define the total variation distance 
between two distributions P and Q supported on a finite set A as follows 



Similarly, if X and Y are two random variables ranging over a finite set, their total variation 
distance, denoted ||^ ; ^||, is defined as the total variation distance between their distributions. 
We also define the Translated Poisson distribution as follows. 

Definition 2.1 ([26]) We say that an integer random variable Y has a translated Poisson distri- 
bution with paremeters fi and cr^ and write 



if C{Y — [fi — cr^J) = Poisson{a'^ + {/i — cr^}), where {/i — a^} represents the fractional part of 



E{p^}^_,m\{x) > E{p^j^_^mI{x) + e ^ Pi = 0, 
E^p^y^^^lix) > E{p^}^_^^u\{x) + e^pi = l, 




C{Y) = TP{fi,a^) 




Finally, for a positive integer £, we denote [^] := {1, . . . 



3 Statement of Results 



3.1 A Probabilistic Lemma 



We show the following probabilistic lemma, whose proof is given in Sections HHSl 
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Theorem 3.1 Let be arbitrary probability values, pi G [0, 1] for all i = 1, . . . ,n, let {XjjJLj^ 

be independent indicator random variables such that Xi has expectation £[Xi\ = pi, and let k be 
a positive integer. Then there exists another set of probabilities {qi}f^i, Qi € [0,1], i = l,...,n, 
which satisfy the following properties: 

1. if {Yi}f^^ are independent indicator random variables such that Yi has expectation £\Yi\ = qi, 
then, 



i 

and, for all j = 1, . . . ,n 



0{l/k), 



E^^^ E^^ 



0{l/k). 



(1) 
(2) 



2. the set {qi\^^i is such that: 

(a) if Pi = then qi = 0; 

(b) one of the following is true: 

i. there exists S C [n] and some value q which is an integer multiple of such that, 

for all i ^ S, Qi £ {0, 1}, and, for all i & S, qi = q; 
a. or, there exists S C [n], \S\ < k^ such that, for all i ^ S, qi £ {0,1}, and, for all 
i € S, qi is an integer multiple of . 

3.2 Efficient PTAS 

From Theorem 13.11 our main resuh follows. Its proof is based on the following observation: if we 
replace a strategy profile that is a Nash equilibrium by the nearby strategy profile ((?i)f=i 

specified by Theorem 13. H then the change in each player's utility is bounded by the total variation 
distance between the number of players playing their second strategy in the two strategy profiles. 
It follows that the search for approximate Nash equilibria can be restricted to the strategy profiles 



of the form 2(b)i or 2(b)ii specified in Theorem 13. 1^ and this search can be done efficiently with 



dynamic programming. 

Theorem 3.2 For any e < 1, an e-Nash equilibrium of a two-strategy anonymous game with n 
players can be computed in time 

poly(n).t/. (1/6)^(1/^'), 
where U is the number of bits required to represent a payoff value of the game. 

Proof of Theorem \ 3.S^ Consider a mixed Nash equilibrium {pi, . . . ,pn) of the game. We claim 
that any mixed strategy profile {qi, . . . ,qn) satisfying Properties [U and [2] of Theorem 13.11 is a 
0(l/fc)-Nash equilibrium. Indeed, for every player i G [re] and every pure strategy s £ {1,2} 
for that player, let us track down the change in the expected utility of the player for playing 
strategy s when the distribution over {0, . . . ,n — 1} defined by the probability values {pj}j^i is 
replaced by the distribution defined by the probability values {qj}j^i. It is not hard to see that the 
absolute change is bounded by the total variation distance between the distributions of the random 
variables Ylj=/=i^j Yljj^i^j^ where {Xj}jjCi are independent Bernoulli random variables with 
expectations £[Xj] = pj, j ^ i, and {Yj}j^i are independent Bernoulli random variables with 
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expectations £[Yj] = qj, j and this total variation distance is 0{l/k) by Theorem 13. li This 

holds for every choice of i and s and, since the g^'s also satisfy Property [2a| of Theorem 13 -H we 
have that the g^'s constitute an 0(1/A;)-Nash equilibrium of the game. If we take k = 0(l/e), this 
is an e-Nash equilibrium. 

From the previous discussion it follows that there exists a mixed strategy profile {qi}i which is 
of the very special kind described by Property [2b] in the statement of Theorem 13.11 and constitutes 
an e-Nash equilibrium of the given game, if we choose k = 0(l/e). The problem is, of course, that 
we do not know such a mixed strategy profile {qi}i and we also do not know whether it is of the kind 



specified by Property 2(b)i or of the kind specified by Property 2(b)ii Moreover, we cannot afford 
to do exhaustive search over all mixed strategy profiles satisfying Property 2(b)i or 2(b)ii, since 



there is an exponential number of those. We do instead the following two searches, corresponding 



to each of the possibilities 2(b)i and 2(b)ii; one of these searches is guaranteed to find an e-Nash 
equilibrium. 



Search corresponding to 2(h)i We can guess first the cardinality m of the set S (at most n 
choices), the value q {kn + 1 choices), and the number m' of qiS in [n] \ S which are equal to 1 
(at most n choices). Then we only need to determine if there exists a set of players S C [n] and 
another set of players S' C [n] \ 5 such that, if all players in S are assigned mixed strategy g, all 
players in S' mixed strategy 1 and all players in [re] \ 5" \ S' mixed strategy 0, the resulting mixed 
strategy profile is an e-Nash equilibrium. To answer this question it is enough to solve the following 
max- flow problem. Let us define the constants 9q = n — m — m' , Og = m and 9i = m' , and let 
us consider the bipartite graph ([n], {0, q, 1}, E) with edge set E defined as follows: {i, a) € E, for 
i £ [n] and a £ {0,q, 1}, if > and a is an e-best response for player i, if the partition of the 
other players into the mixed strategies 0, q and 1 is the partition 9, with one unit subtracted from 
^ct0 Note that to define E expected payoff computations are required. By straightforward dynamic 
programming, the expected utility of a player i for playing pure strategy s G {1, 2} given the mixed 
strategies of the other players can be computed with O(n^) operations on numbers with at most 
b{n,k) := \1 + nlog2 (kn) + U)~\ bits, where U is the number of bits required to specify a payoff 
value of the gameO To conclude the construction of the max-flow instance we add a source node 
u connected to all the left hand side nodes and a sink node v connected to all the right hand side 
nodes. We set the capacity of the edge {a,v) equal to 9^^, for all a E {0, g, 1}, and the capacity of 
all other edges equal to 1. If the max-flow from u to v has value n then there is a way to select 
S and S' by looking at the edges used by the flow (details omitted). There are 0{n^k) possible 
guesses for {m,q,m'); hence, the search takes overall time 

0{{n^b{n,k) +pi{n)) -n^k) , (3) 

where pi{n) is the time needed to solve a max-flow problem on a graph with n + 5 nodes, 0(n) 
edges, and edge-weights with at most [log2 n] bits. 



Search corresponding to 2(b)ii We can guess the cardinality m of the set S (there are k^ 
choices), the number m' of qis in [n] \ S which are equal to 1 (at most n choices), and a partition of 
l^l into the integer multiples of ^ in [0, 1]; let {<Pi/k'^}i£[k'^]vj{o} be the partition. Then we only need 



^Recall that all utilities have been normalized to take values in [0, 1]. 

*For our discussion, a mixed strategy cr of a player i is an e-best response to a set of mixed strategies for the other 
players iff the expected payoff of player i for playing any pure strategy s in the support of a is no more than e worse 
than her expected payoff for playing any pure strategy s'. 

^To compute a bound on the number of bits required for the expected utility computations, note that every non- 
zero probability value that is computed along the execution of the algorithm must be an integer multiple of (■^)"~^, 
since the mixed strategies of all players are from the set {0, q, 1}. Further note that the expected utility is a weighted 
sum of (n — 1) payoff values, with U bits required to represent each value, and all weights being probabilities. 
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to determine if there exists a set of players S C [n] of cardinality m, a set of players S' C [n] \ 5 of 
cardinality m', and an assignment of mixed strategies to the players in S with (j)i/i.2 of them playing 
mixed strategy i/k'^, so that, if additionally the players in S' are assigned mixed strategy 1 and the 
players in [n] \ 5 \ 5' are assigned mixed strategy 0, then the corresponding mixed strategy profile is 



an e-Nash equilibrium. We answer this question in the same way we did for the case 2(b)i, i.e., by 
mapping the problem to a max-flow instance. Let us define the vector {^i/fc2}j£[;j2]u{o} by setting 
0j/fc2 = (/>j/fc2 for all i 7^ 0, 1, ^o/fc^ = 'Po/k^ + n — m — m' and ^^2/^2 = 4>k2/k2 + m'. The bipartite 
graph of the max-flow instance is now {[n],{i/k'^}i(z[k2][j{Qj, E) with edge set E defined as follows: 
(j, cr) G E, for j € [n] and a € {i/k'^}ie[k^]u{o}j if ^o- > and a is an e-best response for player j, 
if the partition of the other players into the mixed strategies i/k'^ is the partition 9, with one unit 
subtracted from 9^', as argued before, this computation can be carried out with 0{n?) operations 
on numbers with at most b'{n, k) := \1 + nlog(A;^) + U)] bitslfl To conclude the construction we 
include a source node u connected to all left hand side nodes with edges of capacity 1 and a sink 
node V connected to all the right hand side nodes with edges of capacity 9^/^,2, i G [k'^] U {0}. If the 
max-flow from u to v has value n then there is a way to select S, S' and (j) by looking at the edges 
used by the flow (details omitted). There are at most k^ ■ n ■ {{k + l)e)^ choices for {m,m' , 4))', 
hence, the search takes overall time 



O 



(^{n{k'^ + l)n^b'{n,k) +p2{n,k)) ■ nk^{{k + l)ef^^ , (4) 



where P2{n, k) is the time needed to solve a max-flow problem on a graph with n + k"^ + ?> nodes, 
at most (n -|- 1)(A;^ -|- 2) — 1 edges, and edge- weights with at most |'log2 n] bits. 
Prom ([3]) and (jH), it follows that the overall required time is at most 

poly(n)-C/-(l/e)^(^/^'). 



4 Overview of the Proof of Theorem 13.11 

We employ a hybrid argument. In particular, we define first a set of probability values {p^}ig[n] 
and a corresponding set of independent Bernoulli random variables {-^jjigin] with expectations 
£[Zi] = p., for all i G [n], such that 



and, moreover, for all j = 1, . . . , n. 



0{l/k), 



0{l/k). 



(5) 



(6) 



The set of probability values {p'i}i(z\n\ does not necessarily satisfy Property 2(b)i or 2(b)ii in the 



statement of Theorem 13. 1|, but will allow us to define the set of probabilities {(li}i^[n] which does 



®The bound on the number of bits follows from the fact that every non-zero probability value that is computed 



along the execution of the algorithm must be an integer multiple of (t 



since the mixed strategies of all players 



are from the set {i/k }ig[fe2]u{o}- Further, note that the expected utility is a weighted sum of (n — 1) payoff values, 
with U bits required to represent each value, and all weights being probabilities which are multiples of (-p-)"^^. 
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satisfy Property 2(b)i or 2(b)ii and, moreover 



0{l/k), 



and, for all j = 1, 



,n. 



E^^= E^^ 



0(1A). 



(7) 



(8) 



By the triangle inequality, ([5]) and ^ imply ([T]) , and ([6]) and ([8]) imply ([2]) . Let us call Stage 1 the 
process of determining the p'^s and Stage 2 the process of determining the qis. The two stages are 
described briefly below, and in detail in Sections [5] and [6] respectively. 

Stage 1: The goal of this stage is to eliminate any probability value pi falling in the set 7^ := 
(0, ^) U (1 — 1), that is, any pi that is either too small, but non-zero, or too large, but not one. 
To remove the small piS we round some of them to and some of them to 1/k in such a way that 
their sum changes by at most 1/k. Similarly, we round some of the large piS to 1 — 1/k and some 
of them to 1, so that their sum changes by at most 1/k. Finally, we leave the piS falling outside Tk 
unchanged. Our work from [I4j implies that the set of probability values {p'j}i thus defined satisfies 
([5]) and See details in Section [5l 

Stage 2: The definition of the set {qi}i depends on the number m of p'^s which are different than 



and 1. The case m > k^ corresponds to the Case 2(b)i in the statement of Theorem 13.11 and the 

if p'^ G {0, 1}; and here is 



case m < k^ corresponds to the Case 2(b)ii, In both cases we set q. 
how we round the p^'s from the index set A4 := {i \ p'^ ^ {0, 1}}: 



P'v 



Case m > k^: Using results from the literature on Stein's method, we show that the sum of 
Bernoulli random variables with expectations p^, i € A1, can be approximated by a Binomial 
distribution B{m',q), where m' < m and q is an integer multiple of In particular, we 



show that an appropriate choice of m' and q implies ([7|) and 
the index set A4 equal to q and the remaining equal to 0. 



if we set m' of the g^'s from 



• Case m < k^: The Binomial approximation may not be tight enough for small values of m. 
To remedy this, we follow our rounding scheme from [14] . That is, we delicately round the 
p-'s to nearby multiples of p- so that their sum X^igA^Pi is approximated to within l/k"^ by 
the sum J2ieM results from [l^ imply that ([7]) and ^ hold in this case. 



5 Details of Stage 1 

We describe the definition of the set {p'j}i. For concreteness, let 

C := {i\i e [n] Api e {0,1/k)} and Ti := {i \ i £ [n] A pi e {1 - l/k,l)} . 

We set p[ = Pi, for alH € [n]\C[JTC; that is, we leave the probabilities pi falling outside the set 
unchanged. It follows that 



E ' E 

ieln]\CUH ie[n]\CUH 



0, 



and, for all j S [n] \ £ U TC, 



E ^» ' E 

i€[n]\{CUH)\{j} iG[n]\{CUH)\{j} 



0. 



(9) 
(10) 
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5£ 



To round the probabilities pi, i € C, we use the following procedure: 

1. Let Sc := J2iecPi'^ ^ 

2. Let C Q Che an arbitrary subset of C with cardinality \C'\ = m. 

3. Set p'- = I, for all i G and p'^ = 0, for all ieC\ C. 

An application of Lemma 3.9 from [H] with a = 1 implies immediately that 

3 



iec 



< 



and, for all j £C, 



^ Xi; ^ Zi 

iec\{j} iec\{j} 



< 



6 



(11) 
(12) 



where we used that 



EiecPi-EiecPi < i and, for ah j G Ei€C\{j}Pi - EieC\{j}Pi 



< 



k- 



We follow a similar rounding scheme for the probabilities pi, i (z Ti; that is, we round some to 
1 — 1/A; and some to 1 in such a way that their sum is preserved to within 1/k. As a result, we get 
(to see this, repeat the argument we employed above to the variables 1 — Xi and 1 — Zj, i G TY) 



^Xi ; ^Z, 



3 



and, for all j GTI, 



^ Xi] ^ Zi 

ieH\{j} ieH\{j} 



< 



(13) 
(14) 



Using P, dlO]), (dH), ([12]), ([ni), dll]) and the couphng lemma we get ([5]) and 



6 Details of Stage 2 

Recall that Ai := {i \ p'^ ^ {0, 1}} and m := \M.\. The definition of the probability values {qi}i will 
depend on whether m > k"^ or m < k"^ . In particular, the case m > k^ will correspond to the Case 



2(b)i in the statement of Theorem 13 -H and the case m < k^ will correspond to the Case 2(b)ii In 
both cases we set 

Qi = p[, for all i G [n] \ A4. 

It follows that 



ie[n]\M ie[n]\M 



and, for all j G [n]\A^, 



E E 

ie[n]\M\{j} ie[n]\M\{j} 



0. 



(15) 
(16) 
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6.1 The Case m > 



We show that the random variable X^jg^vi -^i is within total variation distance 0(1/ A;) from a 
Binomial distribution B{m',q) with 



m : = 



SigTM Pi 



and q := - — , 

kn 



where T G {0, . . . , fcn} satisfies e [£, 



In particular, let us choose an arbitrary subset M' C JH with cardinality m' (note that m' < mhy 
the Cauchy-Schwarz inequality) and let us set 

Qi = q, for all i G M.' , 
qi = 0, for alH G M \ M' . 

We shall now compare the distributions of the random variables ^i find "l^ieM ^^^^ 

let us set 



H := £ 



:= Var 



and ^' := £^ 



ieM 



and o"'^ := Var 



1 



The following lemma compares the values n, fi' , a, a' . 
Lemma 6.1 The following hold 

\cT"-a'\<l + j, 
k 



(17) 

(18) 
(19) 

(20) 



The proof of Lemma EH is given in the appendix. To compare '^ZieM ■^^ YlieM approxi- 
mate both by Translated Poisson distributions. To do this, we make use of the following theorem, 
due to Rollin [26l. 



Theorem 6.2 ( [26] ) Let Ji, . . . , J„ he a sequence of independent random indicators with £[Ji] 
Pi . Then 



i=l 



< 



Y.7=lPii'^-Pi) 



where fi = Y17=iPi (^nd a"^ = YA=iPi{l -Pi)- 
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Theorem 16.21 implies that 



ieM 



< 



< 



EieMP'ii^-p'i) 
1 



EteMP'ii^-Pd 



+ 



^ ^ 0{l/k). (using (HOD) 



+ 



ky'l^T/k (1 - i) 



Similarly, 



1 



/2^ 



1 2 

< ^ + 



< 



+ 



/2 



1 1 



0{l/k). (using (USD, ([50])) 



By the triangle inequality we then have that 



< 



/2n 



+ rP(/.,a^) ; rP(^',a' 



0{l/k) + \\TP{fi,a') ■ TP{f,',a'^)\\. 



(21) 



It remains to bound the total variation distance between the two Translated Poisson distributions. 
We make use of the following lemma. 

Lemma 6.3 ([2]) Let ^1,^2 G and al,a2 G M+ \ {0} be such that [^1 — af] < [^2 — o"!] • Then 



\TP{^il,al)-TPi^,2,al)\ \ < 



1/^1 -/W2i , |o-f-0-|j + l 



0-1 



+ 



0-t 



Lemma 16.31 implies 

\\TP{fx,a^) ; TP(^',a'2)|| < 
1/A; 



\a'-a'^\ + l 



< 



min((T, fj') min((T^,o"'^ 
2 + 3/k 



k. l-i 



1 1 



A; fc^ fc^ 



fc2 1 - i 



1 1 



(using Lemma l6.ip 



k k^) 



= 0{l/k'). 
Using ^ and (l22|) we get 



E^- E^^ 



0{l/k). 



(22) 



(23) 



We claim that the following is also true (see proof in the appendix). 
Lemma 6.4 For all j A4: 



E E 

ieM\{j} ieM\{j} 



0{l/k). 



(24) 
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6.2 The Case m < 

Theorem 3.1 of [Hj implies that there exists a set of probability values {qi}i^M: such that 
— Qi is an integer multiple of —77 , for alH G A1 ; 



and, for all j G 



0(1A); 



E E 

ieA4\{j} ie7H\{i} 



0{l/k). 



(25) 
(26) 



6.3 Concluding Stage 2 

In the case m > considered in Section [6.H a set of probability values {qi}i was defined which 



satisfied Property 2(b)i in the statement of Theorem 13. li In the case m < k^ considered in 



Section [6^ the resulting set {qi}i satisfied Property 2(b)ii Moreover, in both cases, the following 
were satisfied 



E^- E^« 



0{l/k) 



E E 

ie[n]\A^ ie[n]\M 



(27) 
(28) 



and. 



for all j ^ M, 



E E 

iGA4\{j} i&M\{j} 



0(1A), 



for all i G [n]\7V(, 



E E 

ie[n]\M\{j} ie[n]\M\{j} 



0. 



(29) 

(30) 



Using ([271), dMl), (129]), ([30]) and the couphng lemma, we get ([7]) and ([8]). 
Acknowledgment: We thank Christos Papadimitriou for helpful discussions. 
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APPENDIX 

A Missing Proofs 

Proof of lemma \6.1[ We have 

Y^ieM Pi L < g = ^ < ^jeA4 Pi 



Hence, 



and 



Moreover, 



f \ ^ / f \ / \ ^ I rn! 1 

i&M i€M ieM 

II- II > ypi-m = 0. 

^-^ m' 

ieM 

H=^p'i>m^>k^, 

i&M 



since ni > k^. For convenience let us denote A2 := ^jg^viPi^- Then 

= ^^PiC^-p'i) =/^-A2, 



i€M 



and a'^ = ^ qi{l - qi) = fi' - qf = n' - m'q^. 



ieM i£M 



Hence, 

Now note that 



a — a — '■■ 



- /i') + {m'q^ - A2) . 



Therefore, 



^ ^ieMPi) ^ I ^ {YieM Pi) _L 1 _ /f_ I 1 

A2 Z^ieMPi l^ieMPi -^2 

^ /"^ 1 2/x ^ 2 ^ /^^ 

m'2 (A;n)2 m'kn — ^ — j^'2 



< — + T-r-TTT - — < "I g < — < — = A2. 



_l_ 1 kn m' (kn)'^ kn m' !±_ 
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which imphes 



|mV-A2| < 



2li 



2m Xi 2 



Hence, 



Finally, 



1 2 



- tr'^l < l/i - + [mV - A2I < + + 1 < 1 + 



Proof of lemma \6.4\' Let us fix some j (z A4 and denote 



fi-i := £ 



aij := Var 



ieA4\{j} 



and := f 



ieA^\{j} 



and 0"^^ := Var 



ie-M\{i} 



lieM\{j} 

Using Lemma 16.11 it is easy to see that fi-j , fi'_j , a^j , a'_j satisfy the following. 
Lemma A.l 



Ifi^j- fi'A < 1 + 



\a'^j -crljl < 1.25 + 



fJ—j > k — 1, 



l\ 1 



(31) 

(32) 
(33) 

Proof of Lemma \A.l[ Note that fi_j = IJ' — p'j and fi'_j = fi' — qj. Hence, using Lemma |6. II we have 

^-j > k'^ - I and - <\^i- jJ.'] + \p'j - qj\ < ^ + l- 
Similarly, = — p'j{l — p'j) and a'^j = o"'^ — qj{l — qj). Hence, using Lemma ET] we have 



- \' kj 4 

.'2 _2 I /I /2 _2| , 1/ , 



and - atj\ <\a'^ - + \p'j{l-p'j) - qjil - qj)\ < 1.25 + -. 



Theorem 16.21 implies that 



ieM\{j} 



1 2 

< + — < 

= 0(1/A;). 



+ 



fc2 (1 



(using 



(35) 
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Similarly, 



ieM\{j} 



12 



1 2 

^ — + — 



< 



+ 



/fc2 (i_ 1) _i.5_ 3 A:2(l-i)-1.5 

= o(iA). 

Finally, from Lemma 16.31 and Lemma lA.ll we have 



(using 



-.1 ^I'^-.-^^^-l + l 



min((T_j- , cj_j- ) min((T_j , a': 



< 



1 + i 



+ 



2.25 + 



= o(iA). 

By the triangle inequality and (f35]) . (f37|) . ([38l) we get that 



ieA4\{i} i6A1\{j} 



o(iA). 



(36) 



(37) 



(38) 
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